Serveur d'exploration SRAS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes

Identifieur interne : 001011 ( Pmc/Checkpoint ); précédent : 001010; suivant : 001012

ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes

Auteurs : Feng-Biao Guo ; Chun-Ting Zhang

Source :

RBID : PMC:1352377

Abstract

Background

It necessary to use highly accurate and statistics-based systems for viral and phage genome annotations. The GeneMark systems for gene-finding in virus and phage genomes suffer from some basic drawbacks. This paper puts forward an alternative approach for viral and phage gene-finding to improve the quality of annotations, particularly for newly sequenced genomes.

Results

The new system ZCURVE_V has been run for 979 viral and 212 phage genomes, respectively, and satisfactory results are obtained. To have a fair comparison with the currently available software of similar function, GeneMark, a total of 30 viral genomes that have not been annotated by GeneMark are selected to be tested. Consequently, the average specificity of both systems is well matched, however the average sensitivity of ZCURVE_V for smaller viral genomes (< 100 kb), which constitute the main parts of viral genomes sequenced so far, is higher than that of GeneMark. Additionally, for the genome of Amsacta moorei entomopoxvirus, probably with the lowest genomic GC content among the sequenced organisms, the accuracy of ZCURVE_V is much better than that of GeneMark, because the later predicts hundreds of false-positive genes. ZCURVE_V is also used to analyze well-studied genomes, such as HIV-1, HBV and SARS-CoV. Accordingly, the performance of ZCURVE_V is generally better than that of GeneMark. Finally, ZCURVE_V may be downloaded and run locally, particularly facilitating its utilization, whereas GeneMark is not downloadable. Based on the above comparison, it is suggested that ZCURVE_V may serve as a preferred gene-finding tool for viral and phage genomes newly sequenced. However, it is also shown that the joint application of both systems, ZCURVE_V and GeneMark, leads to better gene-finding results. The system ZCURVE_V is freely available at: http://tubic.tju.edu.cn/Zcurve_V/.

Conclusion

ZCURVE_V may serve as a preferred gene-finding tool used for viral and phage genomes, especially for anonymous viral and phage genomes newly sequenced.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-7-9) contains supplementary material, which is available to authorized users.


Url:
DOI: 10.1186/1471-2105-7-9
PubMed: 16401352
PubMed Central: 1352377


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

PMC:1352377

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes</title>
<author>
<name sortKey="Guo, Feng Biao" sort="Guo, Feng Biao" uniqKey="Guo F" first="Feng-Biao" last="Guo">Feng-Biao Guo</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Zhang, Chun Ting" sort="Zhang, Chun Ting" uniqKey="Zhang C" first="Chun-Ting" last="Zhang">Chun-Ting Zhang</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">16401352</idno>
<idno type="pmc">1352377</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1352377</idno>
<idno type="RBID">PMC:1352377</idno>
<idno type="doi">10.1186/1471-2105-7-9</idno>
<date when="2006">2006</date>
<idno type="wicri:Area/Pmc/Corpus">000272</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000272</idno>
<idno type="wicri:Area/Pmc/Curation">000272</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000272</idno>
<idno type="wicri:Area/Pmc/Checkpoint">001011</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">001011</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes</title>
<author>
<name sortKey="Guo, Feng Biao" sort="Guo, Feng Biao" uniqKey="Guo F" first="Feng-Biao" last="Guo">Feng-Biao Guo</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
<author>
<name sortKey="Zhang, Chun Ting" sort="Zhang, Chun Ting" uniqKey="Zhang C" first="Chun-Ting" last="Zhang">Chun-Ting Zhang</name>
<affiliation>
<nlm:aff id="Aff1"></nlm:aff>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>It necessary to use highly accurate and statistics-based systems for viral and phage genome annotations. The GeneMark systems for gene-finding in virus and phage genomes suffer from some basic drawbacks. This paper puts forward an alternative approach for viral and phage gene-finding to improve the quality of annotations, particularly for newly sequenced genomes.</p>
</sec>
<sec>
<title>Results</title>
<p>The new system ZCURVE_V has been run for 979 viral and 212 phage genomes, respectively, and satisfactory results are obtained. To have a fair comparison with the currently available software of similar function, GeneMark, a total of 30 viral genomes that have not been annotated by GeneMark are selected to be tested. Consequently, the average specificity of both systems is well matched, however the average sensitivity of ZCURVE_V for smaller viral genomes (< 100 kb), which constitute the main parts of viral genomes sequenced so far, is higher than that of GeneMark. Additionally, for the genome of
<italic>Amsacta moorei</italic>
entomopoxvirus, probably with the lowest genomic GC content among the sequenced organisms, the accuracy of ZCURVE_V is much better than that of GeneMark, because the later predicts hundreds of false-positive genes. ZCURVE_V is also used to analyze well-studied genomes, such as HIV-1, HBV and SARS-CoV. Accordingly, the performance of ZCURVE_V is generally better than that of GeneMark. Finally, ZCURVE_V may be downloaded and run locally, particularly facilitating its utilization, whereas GeneMark is not downloadable. Based on the above comparison, it is suggested that ZCURVE_V may serve as a preferred gene-finding tool for viral and phage genomes newly sequenced. However, it is also shown that the joint application of both systems, ZCURVE_V and GeneMark, leads to better gene-finding results. The system ZCURVE_V is freely available at:
<ext-link ext-link-type="uri" xlink:href="http://tubic.tju.edu.cn/Zcurve_V/">http://tubic.tju.edu.cn/Zcurve_V/</ext-link>
.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>ZCURVE_V may serve as a preferred gene-finding tool used for viral and phage genomes, especially for anonymous viral and phage genomes newly sequenced.</p>
</sec>
<sec>
<title>Electronic supplementary material</title>
<p>The online version of this article (doi:10.1186/1471-2105-7-9) contains supplementary material, which is available to authorized users.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Besemer, J" uniqKey="Besemer J">J Besemer</name>
</author>
<author>
<name sortKey="Lomsadze, A" uniqKey="Lomsadze A">A Lomsadze</name>
</author>
<author>
<name sortKey="Borodovsky, M" uniqKey="Borodovsky M">M Borodovsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
<author>
<name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author>
<name sortKey="Kasif, S" uniqKey="Kasif S">S Kasif</name>
</author>
<author>
<name sortKey="White, O" uniqKey="White O">O White</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Delcher, Al" uniqKey="Delcher A">AL Delcher</name>
</author>
<author>
<name sortKey="Harmon, D" uniqKey="Harmon D">D Harmon</name>
</author>
<author>
<name sortKey="Kasif, S" uniqKey="Kasif S">S Kasif</name>
</author>
<author>
<name sortKey="White, O" uniqKey="White O">O White</name>
</author>
<author>
<name sortKey="Salzberg, Sl" uniqKey="Salzberg S">SL Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Guo, Fb" uniqKey="Guo F">FB Guo</name>
</author>
<author>
<name sortKey="Ou, Hy" uniqKey="Ou H">HY Ou</name>
</author>
<author>
<name sortKey="Zhang, Ct" uniqKey="Zhang C">CT Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Badger, Jh" uniqKey="Badger J">JH Badger</name>
</author>
<author>
<name sortKey="Olsen, Gj" uniqKey="Olsen G">GJ Olsen</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Frishman, D" uniqKey="Frishman D">D Frishman</name>
</author>
<author>
<name sortKey="Mironov, A" uniqKey="Mironov A">A Mironov</name>
</author>
<author>
<name sortKey="Mewes, Hw" uniqKey="Mewes H">HW Mewes</name>
</author>
<author>
<name sortKey="Gelfand, M" uniqKey="Gelfand M">M Gelfand</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
<author>
<name sortKey="Madden, Tl" uniqKey="Madden T">TL Madden</name>
</author>
<author>
<name sortKey="Schaffer, Aa" uniqKey="Schaffer A">AA Schaffer</name>
</author>
<author>
<name sortKey="Zhang, J" uniqKey="Zhang J">J Zhang</name>
</author>
<author>
<name sortKey="Zhang, Z" uniqKey="Zhang Z">Z Zhang</name>
</author>
<author>
<name sortKey="Miller, W" uniqKey="Miller W">W Miller</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Besemer, J" uniqKey="Besemer J">J Besemer</name>
</author>
<author>
<name sortKey="Borodovsky, M" uniqKey="Borodovsky M">M Borodovsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mills, R" uniqKey="Mills R">R Mills</name>
</author>
<author>
<name sortKey="Rozanov, M" uniqKey="Rozanov M">M Rozanov</name>
</author>
<author>
<name sortKey="Lomsadze, A" uniqKey="Lomsadze A">A Lomsadze</name>
</author>
<author>
<name sortKey="Tatusova, T" uniqKey="Tatusova T">T Tatusova</name>
</author>
<author>
<name sortKey="Borodovsky, M" uniqKey="Borodovsky M">M Borodovsky</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tech, M" uniqKey="Tech M">M Tech</name>
</author>
<author>
<name sortKey="Merkl, R" uniqKey="Merkl R">R Merkl</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chen, Ll" uniqKey="Chen L">LL Chen</name>
</author>
<author>
<name sortKey="Ou, Hy" uniqKey="Ou H">HY Ou</name>
</author>
<author>
<name sortKey="Zhang, R" uniqKey="Zhang R">R Zhang</name>
</author>
<author>
<name sortKey="Zhang, Ct" uniqKey="Zhang C">CT Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhang, Ct" uniqKey="Zhang C">CT Zhang</name>
</author>
<author>
<name sortKey="Zhang, R" uniqKey="Zhang R">R Zhang</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Burset, M" uniqKey="Burset M">M Burset</name>
</author>
<author>
<name sortKey="Guig, R" uniqKey="Guig R">R Guigó</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bawden, Al" uniqKey="Bawden A">AL Bawden</name>
</author>
<author>
<name sortKey="Glassberg, Kj" uniqKey="Glassberg K">KJ Glassberg</name>
</author>
<author>
<name sortKey="Diggans, J" uniqKey="Diggans J">J Diggans</name>
</author>
<author>
<name sortKey="Shaw, R" uniqKey="Shaw R">R Shaw</name>
</author>
<author>
<name sortKey="Farmerie, W" uniqKey="Farmerie W">W Farmerie</name>
</author>
<author>
<name sortKey="Moyer, Rw" uniqKey="Moyer R">RW Moyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ksiazek, Tg" uniqKey="Ksiazek T">TG Ksiazek</name>
</author>
<author>
<name sortKey="Erdman, D" uniqKey="Erdman D">D Erdman</name>
</author>
<author>
<name sortKey="Goldsmith, Cs" uniqKey="Goldsmith C">CS Goldsmith</name>
</author>
<author>
<name sortKey="Zaki, Sr" uniqKey="Zaki S">SR Zaki</name>
</author>
<author>
<name sortKey="Peret, T" uniqKey="Peret T">T Peret</name>
</author>
<author>
<name sortKey="Emery, S" uniqKey="Emery S">S Emery</name>
</author>
<author>
<name sortKey="Tong, S" uniqKey="Tong S">S Tong</name>
</author>
<author>
<name sortKey="Urbani, C" uniqKey="Urbani C">C Urbani</name>
</author>
<author>
<name sortKey="Comer, Ja" uniqKey="Comer J">JA Comer</name>
</author>
<author>
<name sortKey="Lim, W" uniqKey="Lim W">W Lim</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Marra, Ma" uniqKey="Marra M">MA Marra</name>
</author>
<author>
<name sortKey="Jones, Sj" uniqKey="Jones S">SJ Jones</name>
</author>
<author>
<name sortKey="Astell, Cr" uniqKey="Astell C">CR Astell</name>
</author>
<author>
<name sortKey="Holt, Ra" uniqKey="Holt R">RA Holt</name>
</author>
<author>
<name sortKey="Brooks Wilson, A" uniqKey="Brooks Wilson A">A Brooks-Wilson</name>
</author>
<author>
<name sortKey="Butterfield, Ys" uniqKey="Butterfield Y">YS Butterfield</name>
</author>
<author>
<name sortKey="Khattra, J" uniqKey="Khattra J">J Khattra</name>
</author>
<author>
<name sortKey="Asano, Jk" uniqKey="Asano J">JK Asano</name>
</author>
<author>
<name sortKey="Barber, Sa" uniqKey="Barber S">SA Barber</name>
</author>
<author>
<name sortKey="Chan, Sy" uniqKey="Chan S">SY Chan</name>
</author>
<author>
<name sortKey="Cloutier, A" uniqKey="Cloutier A">A Cloutier</name>
</author>
<author>
<name sortKey="Coughlin, Sm" uniqKey="Coughlin S">SM Coughlin</name>
</author>
<author>
<name sortKey="Freeman, D" uniqKey="Freeman D">D Freeman</name>
</author>
<author>
<name sortKey="Girn, N" uniqKey="Girn N">N Girn</name>
</author>
<author>
<name sortKey="Griffith, Ol" uniqKey="Griffith O">OL Griffith</name>
</author>
<author>
<name sortKey="Leach, Sr" uniqKey="Leach S">SR Leach</name>
</author>
<author>
<name sortKey="Mayo, M" uniqKey="Mayo M">M Mayo</name>
</author>
<author>
<name sortKey="Mcdonald, H" uniqKey="Mcdonald H">H McDonald</name>
</author>
<author>
<name sortKey="Montgomery, Sb" uniqKey="Montgomery S">SB Montgomery</name>
</author>
<author>
<name sortKey="Pandoh, Pk" uniqKey="Pandoh P">PK Pandoh</name>
</author>
<author>
<name sortKey="Petrescu, As" uniqKey="Petrescu A">AS Petrescu</name>
</author>
<author>
<name sortKey="Robertson, Ag" uniqKey="Robertson A">AG Robertson</name>
</author>
<author>
<name sortKey="Schein, Je" uniqKey="Schein J">JE Schein</name>
</author>
<author>
<name sortKey="Siddiqui, A" uniqKey="Siddiqui A">A Siddiqui</name>
</author>
<author>
<name sortKey="Smailus, De" uniqKey="Smailus D">DE Smailus</name>
</author>
<author>
<name sortKey="Stott, Jm" uniqKey="Stott J">JM Stott</name>
</author>
<author>
<name sortKey="Yang, Gs" uniqKey="Yang G">GS Yang</name>
</author>
<author>
<name sortKey="Plummer, F" uniqKey="Plummer F">F Plummer</name>
</author>
<author>
<name sortKey="Andonov, A" uniqKey="Andonov A">A Andonov</name>
</author>
<author>
<name sortKey="Artsob, H" uniqKey="Artsob H">H Artsob</name>
</author>
<author>
<name sortKey="Bastien, N" uniqKey="Bastien N">N Bastien</name>
</author>
<author>
<name sortKey="Bernard, K" uniqKey="Bernard K">K Bernard</name>
</author>
<author>
<name sortKey="Booth, Tf" uniqKey="Booth T">TF Booth</name>
</author>
<author>
<name sortKey="Bowness, D" uniqKey="Bowness D">D Bowness</name>
</author>
<author>
<name sortKey="Czub, M" uniqKey="Czub M">M Czub</name>
</author>
<author>
<name sortKey="Drebot, M" uniqKey="Drebot M">M Drebot</name>
</author>
<author>
<name sortKey="Fernando, L" uniqKey="Fernando L">L Fernando</name>
</author>
<author>
<name sortKey="Flick, R" uniqKey="Flick R">R Flick</name>
</author>
<author>
<name sortKey="Garbutt, M" uniqKey="Garbutt M">M Garbutt</name>
</author>
<author>
<name sortKey="Gray, M" uniqKey="Gray M">M Gray</name>
</author>
<author>
<name sortKey="Grolla, A" uniqKey="Grolla A">A Grolla</name>
</author>
<author>
<name sortKey="Jones, S" uniqKey="Jones S">S Jones</name>
</author>
<author>
<name sortKey="Feldmann, H" uniqKey="Feldmann H">H Feldmann</name>
</author>
<author>
<name sortKey="Meyers, A" uniqKey="Meyers A">A Meyers</name>
</author>
<author>
<name sortKey="Kabani, A" uniqKey="Kabani A">A Kabani</name>
</author>
<author>
<name sortKey="Li, Y" uniqKey="Li Y">Y Li</name>
</author>
<author>
<name sortKey="Normand, S" uniqKey="Normand S">S Normand</name>
</author>
<author>
<name sortKey="Stroher, U" uniqKey="Stroher U">U Stroher</name>
</author>
<author>
<name sortKey="Tipples, Ga" uniqKey="Tipples G">GA Tipples</name>
</author>
<author>
<name sortKey="Tyler, S" uniqKey="Tyler S">S Tyler</name>
</author>
<author>
<name sortKey="Vogrig, R" uniqKey="Vogrig R">R Vogrig</name>
</author>
<author>
<name sortKey="Ward, D" uniqKey="Ward D">D Ward</name>
</author>
<author>
<name sortKey="Watson, B" uniqKey="Watson B">B Watson</name>
</author>
<author>
<name sortKey="Brunham, Rc" uniqKey="Brunham R">RC Brunham</name>
</author>
<author>
<name sortKey="Krajden, M" uniqKey="Krajden M">M Krajden</name>
</author>
<author>
<name sortKey="Petric, M" uniqKey="Petric M">M Petric</name>
</author>
<author>
<name sortKey="Skowronski, Dm" uniqKey="Skowronski D">DM Skowronski</name>
</author>
<author>
<name sortKey="Upton, C" uniqKey="Upton C">C Upton</name>
</author>
<author>
<name sortKey="Roper, Rl" uniqKey="Roper R">RL Roper</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mchardy, Ac" uniqKey="Mchardy A">AC McHardy</name>
</author>
<author>
<name sortKey="Goesmann, A" uniqKey="Goesmann A">A Goesmann</name>
</author>
<author>
<name sortKey="Puhler, A" uniqKey="Puhler A">A Puhler</name>
</author>
<author>
<name sortKey="Meyer, F" uniqKey="Meyer F">F Meyer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Benson, Da" uniqKey="Benson D">DA Benson</name>
</author>
<author>
<name sortKey="Karsch Mizrachi, I" uniqKey="Karsch Mizrachi I">I Karsch-Mizrachi</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
<author>
<name sortKey="Ostell, J" uniqKey="Ostell J">J Ostell</name>
</author>
<author>
<name sortKey="Wheeler, Dl" uniqKey="Wheeler D">DL Wheeler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pruitt, Kd" uniqKey="Pruitt K">KD Pruitt</name>
</author>
<author>
<name sortKey="Tatusova, T" uniqKey="Tatusova T">T Tatusova</name>
</author>
<author>
<name sortKey="Maglott, Dr" uniqKey="Maglott D">DR Maglott</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Trifonov, En" uniqKey="Trifonov E">EN Trifonov</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<pmc article-type="research-article">
<pmc-dir>properties open_access</pmc-dir>
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">BMC Bioinformatics</journal-id>
<journal-id journal-id-type="iso-abbrev">BMC Bioinformatics</journal-id>
<journal-title-group>
<journal-title>BMC Bioinformatics</journal-title>
</journal-title-group>
<issn pub-type="epub">1471-2105</issn>
<publisher>
<publisher-name>BioMed Central</publisher-name>
<publisher-loc>London</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="pmid">16401352</article-id>
<article-id pub-id-type="pmc">1352377</article-id>
<article-id pub-id-type="publisher-id">748</article-id>
<article-id pub-id-type="doi">10.1186/1471-2105-7-9</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Software</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Guo</surname>
<given-names>Feng-Biao</given-names>
</name>
<address>
<email>guofengbiao@eyou.com</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Zhang</surname>
<given-names>Chun-Ting</given-names>
</name>
<address>
<email>ctzhang@tju.edu.cn</email>
</address>
<xref ref-type="aff" rid="Aff1"></xref>
</contrib>
<aff id="Aff1">
<institution-wrap>
<institution-id institution-id-type="GRID">grid.33763.32</institution-id>
<institution-id institution-id-type="ISNI">0000000417612484</institution-id>
<institution>Department of Physics,</institution>
<institution>Tianjin University,</institution>
</institution-wrap>
Tianjin, 300072 China</aff>
</contrib-group>
<pub-date pub-type="epub">
<day>10</day>
<month>1</month>
<year>2006</year>
</pub-date>
<pub-date pub-type="pmc-release">
<day>10</day>
<month>1</month>
<year>2006</year>
</pub-date>
<pub-date pub-type="collection">
<year>2006</year>
</pub-date>
<volume>7</volume>
<elocation-id>9</elocation-id>
<history>
<date date-type="received">
<day>20</day>
<month>7</month>
<year>2005</year>
</date>
<date date-type="accepted">
<day>10</day>
<month>1</month>
<year>2006</year>
</date>
</history>
<permissions>
<copyright-statement>© Guo and Zhang; licensee BioMed Central Ltd. 2006</copyright-statement>
<license>
<license-p>This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.</license-p>
</license>
</permissions>
<abstract id="Abs1">
<sec>
<title>Background</title>
<p>It necessary to use highly accurate and statistics-based systems for viral and phage genome annotations. The GeneMark systems for gene-finding in virus and phage genomes suffer from some basic drawbacks. This paper puts forward an alternative approach for viral and phage gene-finding to improve the quality of annotations, particularly for newly sequenced genomes.</p>
</sec>
<sec>
<title>Results</title>
<p>The new system ZCURVE_V has been run for 979 viral and 212 phage genomes, respectively, and satisfactory results are obtained. To have a fair comparison with the currently available software of similar function, GeneMark, a total of 30 viral genomes that have not been annotated by GeneMark are selected to be tested. Consequently, the average specificity of both systems is well matched, however the average sensitivity of ZCURVE_V for smaller viral genomes (< 100 kb), which constitute the main parts of viral genomes sequenced so far, is higher than that of GeneMark. Additionally, for the genome of
<italic>Amsacta moorei</italic>
entomopoxvirus, probably with the lowest genomic GC content among the sequenced organisms, the accuracy of ZCURVE_V is much better than that of GeneMark, because the later predicts hundreds of false-positive genes. ZCURVE_V is also used to analyze well-studied genomes, such as HIV-1, HBV and SARS-CoV. Accordingly, the performance of ZCURVE_V is generally better than that of GeneMark. Finally, ZCURVE_V may be downloaded and run locally, particularly facilitating its utilization, whereas GeneMark is not downloadable. Based on the above comparison, it is suggested that ZCURVE_V may serve as a preferred gene-finding tool for viral and phage genomes newly sequenced. However, it is also shown that the joint application of both systems, ZCURVE_V and GeneMark, leads to better gene-finding results. The system ZCURVE_V is freely available at:
<ext-link ext-link-type="uri" xlink:href="http://tubic.tju.edu.cn/Zcurve_V/">http://tubic.tju.edu.cn/Zcurve_V/</ext-link>
.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>ZCURVE_V may serve as a preferred gene-finding tool used for viral and phage genomes, especially for anonymous viral and phage genomes newly sequenced.</p>
</sec>
<sec>
<title>Electronic supplementary material</title>
<p>The online version of this article (doi:10.1186/1471-2105-7-9) contains supplementary material, which is available to authorized users.</p>
</sec>
</abstract>
<kwd-group xml:lang="en">
<title>Keywords</title>
<kwd>Viral Genome</kwd>
<kwd>Phage Genome</kwd>
<kwd>Joint Application</kwd>
<kwd>RefSeq Annotation</kwd>
<kwd>Cereal Yellow Dwarf Virus</kwd>
</kwd-group>
<custom-meta-group>
<custom-meta>
<meta-name>issue-copyright-statement</meta-name>
<meta-value>© The Author(s) 2006</meta-value>
</custom-meta>
</custom-meta-group>
</article-meta>
</front>
</pmc>
<affiliations>
<list></list>
<tree>
<noCountry>
<name sortKey="Guo, Feng Biao" sort="Guo, Feng Biao" uniqKey="Guo F" first="Feng-Biao" last="Guo">Feng-Biao Guo</name>
<name sortKey="Zhang, Chun Ting" sort="Zhang, Chun Ting" uniqKey="Zhang C" first="Chun-Ting" last="Zhang">Chun-Ting Zhang</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/SrasV1/Data/Pmc/Checkpoint
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001011 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd -nk 001011 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    SrasV1
   |flux=    Pmc
   |étape=   Checkpoint
   |type=    RBID
   |clé=     PMC:1352377
   |texte=   ZCURVE_V: a new self-training system for recognizing protein-coding genes in viral and phage genomes
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Pmc/Checkpoint/RBID.i   -Sk "pubmed:16401352" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Pmc/Checkpoint/biblio.hfd   \
       | NlmPubMed2Wicri -a SrasV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Tue Apr 28 14:49:16 2020. Site generation: Sat Mar 27 22:06:49 2021